BIOTECHNOLOGY ^ cs» S 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Date Processed by STIC: ^'^H/qI 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 703-308-4212; FAX: 703-308-4221 
Effective 12/13/03 : TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.1 PROGRAM ACCESSIBLE THROUGH THE U S PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 

http://www.usDto.gov/web/offices/Dac/checker/chkr41note.htm 



Applicants submitUng geneUc sequence information electronically on diskette or CD-Rom should be aware that tliere 

a possibility that tlie disk/CD-Roni may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for tlie disk/CD-Rom or replacement disk/CD-Rom. 

Any re p l y including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Offi ce, and instead should be sent via the following to the indicated addresses: 

1 EFS-Bio (<http://www.u spto.gov/ebc/efs/downloads/documents.htm> . EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry directly to (EFFECTIVE 12/01/03): 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two 
20 1 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 Soutli Clark Place, Arlington, VA 22202 ' 

Revised 10/08/03 




RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/864,486 



DATE: 12/05/2003 
TIME: 12:39:17 



ICL 



Input Set : A:\PTO,PG.t:xt 

Output Set: N:\CRF4\12052003\l864486.raw 

^ ^H(ritimm 

<110> APPLICANT: CENTER FOR GENETIC^INGENIERING^AND BIOTHECNOLOGY 

<120> TITLE OF INVENTION: DNA fragments of the methilotrophyc Pichia pastoris yeast 
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gene 

<130> FILE REFERENCE: List of Sequences (English) 
<140> CURRENT APPLICATION NUMBER: US/09/864,486 
<141> CURRENT FILING DATE: 2001-05-24 

<160> NUMBER OF SEQ ID NOS : 10 
<17 0> SOFTWARE: Patent In Ver. 2,1 
■<210> SEQ ID NO: 1 
<211> LENGTH: 684 
<212> TYPE: DNA 

<213> ORGANISM: Pichia pastoris 
<220> FEATURE: 
<221> NAME/KEY: promoter 
<222> LOCATION: (1)..(684) 

<223> OTHER INFORMATION: Sequence that contains the 5' regulating region of ICL gene. 
<4 00> SEQUENCE: 1 

gaattcggac aaatgtgctg ttccggtagc ttgtaggaag cggcatccgt agggcaatat 60 - . 

acgactatag cttctaaagc gtagtacaat gaaatgttcg aaggaacae^ aaacggattt 120 '3 tdU-4( 

gtttttcgta ggctcaacq^ gttgaggtgt aactctttag ^aaagggta agattgattg 180 ^ 
ttcgaagtag gggcctcaaa gggaaagaga aaaaaaaaaa tacaccgaag agttacgtaa 24 0 is %j;.JuuU)JL^ 
gcatatattt tttacgtaaa gcatgattga atttcagcag tattgtttaa caaggctgat 300 "^^^ ^TT*''^-*^ 
gtcgtortgcc aatcaaaaca aaagagattc gcataatgcc ataattgggg tgtgtggg^ 360 / 
(ncccc^aaa cgtctttctc atcatcatct gcaaccccca tcgaacctca ttaaatcaca 420 f ^je /) 
tgacttgtgc gatcctcggt caactcgttc cgtgcaccca ttccaccccg ggctgaccaa 480 V ^^^^ f' 
cgcaaggttc tccgagagtc cgctacccca gatttatatc agcaaccagt cacctttttc 54 0 J a J 
cgggcacgac tctatatgcc ctggaaaacc ggagacgatg agcctgacta taaaaggtga 600 -^"/(^ JLAAV^ 
cagaaccccc aactctggtt aatctcttca acaaatactt tattttcttt caattcaaag 660 Q 
aacacagtat caagtatatc aaga 684 
<210> SEQ ID NO: 2 
<211> LENGTH: 360 
<212> TYPE: DNA 

<213> ORGANISM: Pichia pastoris 
<220> FEATURE: 

<221> NAME/KEY: terminator 
<222> LOCATION: (1)..{360) 

<223> OTHER INFORMATION: Sequence that contains the 3' regulating region of ICL gene 
<400> SEQUENCE: 2 

gtaataggag ttcctaagta gttaagataa ttgacttgag gtatttatag atttgtgtgt 60 
aggtaatatc tatggtcgtc cattcttacc ttggtggggt gacggggcgg tgaataaatc 120 
agttgcgatg aaagacttta caaccttgtc accagagggt gcggtctact gattactaca 180 
aacgacttgg ataaaatttt caattcaaaa tcaatataaa aaaaaaaact taacatcact 240 
gatgtttcac taaactcttt aaacgctcaa cctcagcttc caactcgctc ttgcaaatca 300 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/864,486 



DATE: 12/05/2003 
TIME: 12:39:17 



Input Set : A:\PTO.PG.txt 

Output Set: N:\CRF4\12052003\l864486.raw 



58 gtaactcctc aactttgtct tccagttgac tcattctctt catcttctta gccctggaac 360 

61 <210> SEQ ID NO: 3 

62 <211> LENGTH: 21 

63 <212> TYPE: DNA 

64 <213> ORGANISM: Artificial Sequence 

66 <220> FEATURE: 

67 <223> OTHER INFORMATION: Description of Artificial Sequence: 

68 oligonucleotide 
7 0 <4 00> SEQUENCE: 3 

71 tchggwtggc artgytchtc h 21 

74 <210> SEQ ID NO: 4 

75 <211> LENGTH: 21 

76 <212> TYPE: DNA 

77 <213> ORGANISM: Artificial Sequence 
7 9 <220> FEATURE: 

80 <223> OTHER INFORMATION: Description of Artificial Sequence: 

81 oligonucleotide , 



88 <211> LENGTH: 27 

89 <212> TYPE: DNA 

90 <213> ORGANISM: Artificial Sequence 

92 <220> FEATURE: 

93 <223> OTHER INFORMATION: Description of Artificial Sequence: 

94 oligonucleotide 

96 <400> SEQUENCE: 5 

97 ctgcaggaat taattcgcct tagacat 27 

100 <210> SEQ ID NO: 6 

101 <211> LENGTH: 26 

102 <212> TYPE: DNA ; 

103 <213> ORGANISM: Artificial Sequence 

105 <220> FEATURE: 

106 <223> OTHER INFORMATION: Description of Artificial Sequence: 

107 oligonucleotide 

109 <400> SEQUENCE: 6 

110 aagcttgcgt taacgaatct agaact 26 

113 <210> SEQ ID NO: 7 

114 <211> LENGTH: 23 

115 <212> TYPE: DNA 

116 <213> ORGANISM: Artificial Sequence 

118 <220> FEATURE: 

119 <223> OTHER INFORMATION: Description of Artificial Sequence: 

120 oligonucleotide 

122 <4 00> SEQUENCE: 7 

123 cctcgagctt gtaggaattc gca 23 

126 <210> SEQ ID NO: 8 

127 <211> LENGTH: 24 

128 <212> TYPE: DNA 




21 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/864,486 



DATE: 12/05/2003 
TIME: 12:39:17 



Input Set : A:\PTO.PG.txt 

Output Set: N:\CRF4\12052003\I864486.raw 



129 <213> ORGANISM: Artificial Sequence 

131 <220> FEATURE: 

132 <223> OTHER INFORMATION: Description of Artificial Sequence: 



139 <210> SEQ ID NO: 9 

140 <211> LENGTH: 24 

141 <212> TYPE: DNA 

142 <213> ORGANISM: Artificial Sequence 
14 4 <220> FEATURE: 

145 <223> OTHER INFORMATION: Description of Artificial Sequence: 
14 6 oligonucleotide 

14 8 <400> SEQUENCE: 9 

14 9 atgctagcgc aagctttcct tttc 24 

152 <210> SEQ ID NO: 10 

153 <211> LENGTH: 24 

154 <212> TYPE: DNA 

155 <213> ORGANISM: Artificial Sequence 

157 <220> FEATURE: 

158 <223> OTHER INFORMATION: Description of Artificial Sequence: 

159 oligonucleotide 

161 <400> SEQUENCE: 10 

162 aoctcacqat naoc.t Pi^^it r.t cjnnf^ 24 



133 
135 
136 



oligonucleotide 
<400> SEQUENCE: 8 
aaggtgctag cattcttgat atac 



24 
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VARIABLE LOCATION SUMMARY DATE: 12/05/2003 

^ PATENT APPLICATION: US/09/864,486 TIME: 12:39:18 

Input Set : A:\PTO.PG.txt 
i Output Set: N:\CRF4\12052003\l864486.raw 

Use of n's or Xaa's(NEW RULES) : JJ\JJAj J^UaJj^-^^^^^^ 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Use of <220> to <223> is MANDATORY if n's or Xaa's are present, 

in <220> to <223> section, please explain location of n or Xaa, and which 

residue n or Xaa represents. 



Seq#:l; N Pos . 110,140,161,359,361,366 
Seq#:4; N Pos. 3 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/864,486 



DATE: 
TIME: 



12/05/2003 
12:39:18 



Input Set : A:\PTO.PG.txt 

Output Set: N:\CRF4\12052003\l864486.raw 



L:ll M:270 C: Current Application Number differs, Replaced Current Application Number 
L:12 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:30 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:1 after pos,:60 
M:341 Repeated in SeqNo=l 

L:47 M:283 W: Missing Blank Line separator, <220> field identifier 
L:84 M:258 W: Mandatory Feature missing, <221> Tag not found for SEQ ID#:4 
L:84 M:258 W: Mandatory Feature missing, <222> Tag not found for SEQ ID#:4 
L:84 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:4 after pos . : 0 

L:168 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS: 3 

L:169 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 10 

L:169 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS: 3 
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